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ABSTRACT 

In this paper, we obtain the central limit theorems for LS estimator in simple linear 
errors-in-variables (EV) regression models under some mild conditions. And we also show 
that those conditions are necessary in some sense. 

1. INTRODUCTION 
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The problem for convergence control of estimator is important in practice. The central 
limit theorem plays a main role for researching large sample problems. There are numerous 
studies on the convergence in distribution and various estimation of deviation probability 
are known, see e.g. Ibragimov and Has'miniskii (1979), Ibragimov and Radavicius (1981). 
Especially, for classical maximum likelihood estimator, Ibragimov and Has'miniskii (1979) 
succeeded in proving the central limit theorem of MLE under some regularity conditions. 

In many economic applications, individual observations are very naturally categorized 
into mutually exclusive and exhaustive groups. For example, individuals can be classified 
into cohorts and workers are employees of a particular firm. The simplest grouping estimator 
involves taking the means of all variables for each group and then carrying out a group-level 
regression by OLS or weighted least squares (if there are different numbers of observations 
in different groups) . This estimator has been called the efficient Wald estimator (cf . Angrist 
(1991)). 

It is well known that there is small sample bias in the efficient Wald estimator because 
the sample group means do not equal the underlying population means. Deaton (1985) 
proposed an Errors-in- Variables (EV) estimator to correct the effect of sampling errors. 

The main purpose of this paper is to study the asymptotic normality of LS estimator for 
an EV model. For simphcity of representation, as Liu and Chen (2005), we restrict ourselves 
to the case of simple linear model: 

r]i^9 + I3xi + Si, ^i^Xi + Si, 1 <i<n (1) 

with the following assumptions: 

(1) 6, /3, xi, X2, ■ ■ ■ are unknown constants (parameters); 

(2) (£i. Si), (£2, S2), ■ ■ ■ are independent identically distributed (i.i.d.) random vectors, £1, 

£2, ■ • ■ are i.i.d., ^i, S2, ■ ■ ■ are i.i.d., and 

ESi = Eei = 0, < Var{5i) = af, Var{ei) ^ < 00; 

(3) ^i, r]i, i — 1,2, ■■ • are observable. 
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From ([T]) we have 

Tji = 9 + (3^i + Pi, Vi = ei-(35i, l<i<n. (2) 

Recently the studies for EV model have attracted much attention: see, for example, 
Gleser (1981) obtained some large sample results of estimation in a multivariate "errors in 
variables" regression model. Amemiya and Fuller (1984) discussed the estimation for the 
multivariate errors- in- variables model with estimated error covariance matrix. Cui (1997) 
proved the asymptotic normality of M-estimates in the EV model and Liu and Chen (2005) 
gave the consistency of LS estimator of linear EV regression model under rather weakly 
conditions and discovered that while in the ordinary linear regression (with the errors i.i.d., 
the error-variance is finite and non-zero) the weak, strong and quadratic-mean consistency of 
the regression estimates are equivalent, it is not so in the EV case, i.e., the quadratic-mean 
consistency requires much more stronger conditions. 

Consider formally ([2]) as a usual regression model of 77^ on ^i, we get the LS estimator of 
9 and f3 as 

A _ '^i=l{^i ~ Cn){Vi — Vn) Q _ ^ _ a ? ln\ 

where f„ = Yl^=i other similar notations, such as f]n, 5„, are defined in the same 

way. 

Under some common assumptions, Liu and Chen (2005) proved the consistency of LS 
estimators of simple linear EV model, and obtained that both weak and strong consistency 
of the estimator are equivalent, but it is not so for quadratic-mean consistency. They also 
proved that the following condition is the sufficient and necessary condition for /3„ being 
strong and weak consistent estimate of (3: 

lim n~^Sn = 00, 

n— >oo 

where Sn = J2'i=ii^i - ^nf- 
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The model to be studied can be exactly described as follows: 



r]i = 6 + (3xi + Si, = Xi + 6i,l < i < n; 

{ei,6i),l < i < riji.i.d.; (4) 
E6i = Eei = 0, E6f = af, Ee? = < af, a| < oo. 



Here {^i,'r]i), 1 < i < n are observable, while Xi,l < i < n, 6, P, af, unknown 
parameters. We are mainly interested in the convergence rate of the estimators for the 
regression coefficient (3 and constant term 6. 
Before our works, we mention a key lemma. 

Lemma 1. (Feller (1971), p530) For each n > 1, let Xn,i, ■ ■ ■ ,Xn,n be n independent vari- 
ables with distribution Fn^k, 1 < k < n. Let Tn = X]fc=i"'^n,fc- Suppose that E(X„fc) = 0, 
E(T2) = 1, and that 

n „ 

/ x^Fn,k{dx) ^ (5) 
k=i >^l^l>^ 

for each r > 0. Then the distribution of Tn tends to A^(0,1), which is a standard normal 
distribution. 

Theorem 1. Under model 0), let Sn denote Yll=i{^i ~ ^n)^ ^'^^ assume that the following 
condition satisfied: 

lim ^= = 0. (6) 

In addition, let the following conditions be fulfilled: there exists a constant a > 0, such that 
E|£i|2+" < OO, E|5i|2+" < OO and 

lim max ^'^^ = 0, (7) 

^71 

then l3n — P satisfies the asymptotic normality, i.e.. 



A Ar(o,l), 



where N{0, 1) is a standard normal distribution. 



Remarks 1. Since Sn — Yl^=i{^i ~ ^nY, and together with (6) and (7), we have 

n -C \/S^ -C max \xi — Xnf <^ Sn, (9) 

l<i<n 

where "a„ <^ 6„" denotes ^ oo, — > oo and hm = 0. This imphes that the LS 

n^oo On 

estimator possess good large-sample properties only when {xn, n> 1} has greater dispersion. 
For instance, if n > 1} is a sequence of i.i.d. random variables with common distribution 
A^(0, 1), the LS estimates have no consistency and asymptotic normality. In Section 4, we 
will show that the condition (6) is necessary in some sense. 

Proof. By simple calculation, we have 

^ / n n n \ 

Pn^P+ _ , - ^")^^ - ^ - - f3j2(^i - Sn)' ■ (10) 

For obtaining our result, we need the following lemmas. 
Lemma 2. For any r > 0, we have 

linipfSki^^M>^^ =0. (11) 
Proof. For any r > 0, because of the following relation. 



where [Sn] denote the integer part of Sn and af — E5j, we have 
p ( Etii^i^^nf > < p fE^lS- > r 

Sfi / \ V Sn 

^ -2^ V -2 y/Sn 



<p, E^jSf-al) ^ r^ /^E^M^ > ^ _ Hfl] 



=Ai + A2. 

Next we will control Ai and A2 respectively. By the law of large numbers, Ai is convergent 
to zero in probability. For enough large n, we have 

v^[v^ /r2 _2\ .„ „„ _2 . 



Since the same reasons with Ai, A2 is convergent to zero in probabihty. From the discussion 



above, we obtain the lemma. 



□ 



Lemma 3. For any r > 0, we have 

lim P ( Ei=ite-^n)' _ ^ > = 0. (12) 

Proof. In fact, we only need to prove the form P( — J 1 > r j , and the proof of 

\ On ' 

the other form — ^ 1 < — rj is similar. By simple calculati 

n n n n 



tion, we have 



i=l 



i=l 



4 = 1 



and 



Thereby, we get 



71 „ ft 



i=l 



i=l 



Hence, Lemma 3 turns to be obvious due to Lemma [2] 



Lemma 4. For any r > 0, we have 



lim P 

n— »oo 



> r = 0. 



Proof. For any r > 0, we have 



p 



Sn 



So, by Lemma [21 our result is obtained. 



Lemma 5. We have 



^Var{e^ - (36,) 



iV(0,l). 



□ 



□ 



Proof. For any n > 1 and 1 < i < n, denote 

Then for any n > 1, {^n.i}"=i is a sequence of independent random variables, and IEX„ j 
^1^=1 ^-^ni ~ Next, we shall consider the following control: for any r > 0, 

n 

^E[\Xn,i\^l{\X„,,\>r}] 
i=l 

< I V f - a:„)a;p+° 

«=1 \ VSnVar(ei-/35i) / 

where F{x) denotes the distribution of 5i — (36i. Therefore, 

n 

^E[\Xn,i\^l{\X„,,\>r}] 



i=l 

< — : , — J- max 



|2+o 



and by ([7]), we obtain 

n 

~* i=l 

From Lemma [H we obtain our lemma. 

Now we give the proof of Theorem [H By ffTOl) . we have 
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From Lemma [3] and Lemma HI it is easy to see that in probability. 



and with the same reasons, in probability 



Because of Lemma [3] and Lemma [5l we have 



Therefore, the proof of Theorem [1] is completed. 

□ 
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In this section, we shall discuss the central limit theorem of estimator of parameter 9. 
LS estimate of 9 is 

9n = f]n- PnCn. (14) 

Hence 

9n-9={p- f3n)Xn + (/? - /3n)4 " f3Sn + ^n- (15) 

Liu and Chen (2005), have proved the following theorem, which gave the sufficient and 
necessary condition for 9n being weak consistent estimate of 9. 

Theorem 2. Under model a sufficient and necessary condition for the weak consistency 
of9n is 

lim nx„(S'*)~"^ = 0, •S'* = max(n, S'n). (16) 

n— >oo 

Theorem 3. Under model 0), assume that the conditions of the TheoremUl are satisfied. 
In addition, we assume 

^ - oo, (17) 



then 9n — 9 satisfies the asymptotic normality, i.e.. 



iV(0,l). (1^ 



Remarks 2. Since nx"^ <^ Sn and the definition of Sn, we have lim — — - = 1 and f|T6l) 
satisfied. Furthermore, we notice that the normalization of f3n is related to \/S^, however, 
the normalization of 9 is related to ^/n. It is not mysterious, since the parameter (3 has 
coefficients Xi, but 9 not in EV models. And this also suggests that the proof of Theorem [3] 
might be easier than that of Theorem [H 



Proof. It is easy to see, 



n 



Hence, it is enough to show in probabihty 



-.{en-l35n)^N{Q,l). 



By Theorem [H we only need to show in probabihty, 

Xn + On) 0. 



y Sn 

The law of large numbers, and together with the condition (17), yields our results. □ 

4. FURTHER DISCUSSIONS 

Recalling that Liu and Chen (2005) proved that the following condition is the sufficient 
and necessary condition for I3n being strong and weak consistent estimate of (3: 



lim n Sn = oo. 

n—rca 

However, in Theorem [H we assume the condition ([6]), i.e., 

lim —= = 0. (19) 

It seems that the condition (I19p is strong. In this section, we will analyze it and show that 
it is necessary in some sense. 

From the proof of Lemma [21 we know that the condition (fT9|) is a technical condition to 
prove the following limit 

7^i2^i>n=^- ^20) 
k=i J 

Before our discussion, we need mention the following result. 

Theorem 4. (Petrov, 1987) Let {Xk}k>i is a sequence of independent random variables with 
the distribution Vk{x) and {a„}„>i is a sequence of increasing positive real number. Then 

k=l 

if and only if the following conditions are satisfied 



lim P 

n— »oo 



n „ 

J2 / dVkix) ^ 0; 

! 1 J\x\>a„. 



k=l 



(ii) 



2 



0; 



1 " /■ 

— ^ / xdVk{x) 0. 

j^^-^ J\x\<an 

Under our model, we can take a„ = ^/S^, = Si, k > 1 in Theorem HI If 

^"^^ k=i 

then, by (iii) of Theorem HI we have 



1 " 

Since < ¥.61 < oo, we get 



" 0. 



That is to say, (6) is the necessary condition for Theorem 1. 
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